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Abstract — In this paper it is shown that for substantial increase 
of video quality in wireless network it is necessary to execute 
two obligatory points on modernization of the conununication 
scheme. The player on the received part should throw back 
automatically duplicated RTP packets, server of streaming video 
should duplicate the packets containing the information of key 
frames. Coefficients of the mathematical model describing video 
quality in wireless network have been found for WiFi and 
3G standards and codecs MPEG-2 and MPEG-4 (DivX). The 
special experimental technique which has allowed collecting and 
processing the data has been developed for calculation of values 
of factors. 
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I. Introduction 

Mobile solutions in the telecommunications area are being 
increasingly adopted in everyday life. Modem cellular tele- 
phones possess the same capabilities of a traditional computer 
Laptops and netbooks provide a high degree of mobility. 
Owing to their compactness and presence of 3G, WiFi, and 
WiMAX network adapters, we are experiencing pervasive 
communications as and when needed. According to data Cisco 
VNl 1 4 1, the Internet traffic on wireless devices has annual 
growth of more than 250%. By 2013 the volume of such traffic 
will increase by 66 times in comparison with 2008 and will 
make up 5,4% of all IP-traffic in the Internet. By 2013 various 
video content will make 64% of all traffic of wireless networks 
in the world. 

There is serious obstacle for daily use of video services in 
wireless networks: the corresponding communication quality 
is not satisfactory. High degradation levels of packet losses and 
delay variation (network jitter) in wireless networks affect data 
transmission in real time modes of communication such as the 
multimedia traffic, and make them susceptible to distortions in 



user experience. Packets of streaming video experience drastic 
changes in sequence numbers or are lost during transfers on 
network paths because of considerable delay variation. At 
the receiver-side video image, distortions manifest as plural 
artifacts, lack of lip synchronization, and sometimes frame 
freezing. 

The technique of certification of packet networks in recom- 
mendation RFC-2544 lIJl defines the following key parameters 
of network quality: available bandwidth, packet delay, network 
jitter, number of the lost packets, quantity of packages with 
errors. 

The idea of subjective testing (MOS) consists that video 
received after transfer on network, is shown to a commission of 
experts who put down scores, being based on the impressions 
of quality. The initial clip is coded by one of codecs MPEG-2, 
MPEG-4 or Windows Media Video 9 and is passed through 
a wireless network supporting standards such as WiFi, 3G 
or WiMAX. There are many methods of demonstration of 
sequences and gathering of the estimations, some of them are 
described in recommendations ITU |l6l. Unfortunately, they are 
calculated based on video comparison in broadcasting format, 
and aren't so convenient for testing carrying out on PCs. 

In the present paper the problem of adaptation of modern 
coding algorithms and transfer of video for wireless net- 
works, such as 3G, WiFi and WiMAX, and also for other 
networks with distorted quality characteristics is considered. 
In papers fSl, fSl it has been shown that subjective scope of 
quality of video as well as network parameters, has gradation 
Good, Acceptable and Poor (GAP). Such description helps 
to understand the qualitative dependences arising at network 
translation of video and to make primary numerical approach. 

In the present paper, we attempt to find numerical depen- 
dence of video quality on network parameters. Distinctive 
feature of our approach is that the specified dependence is 
described by a simple mathematical model that allows us to 
compare numerical values of coefficients. On the basis of 
similar comparison, we find not only the most essential factors 
influencing quality of video, but also compare them for various 
codecs. 



In paper, another contribution is the characterization of 
distortions which damage the "key" frames as well as the 
"usual" frames is considered. Key frame is the frame which 
bears in itself the full information on the video image and it 
can be restored without reinforcement with the additional data. 
Usual frame is one that codes difference between the previous 
frame and current one. Degree of key frame compression is 
less than at usual shot, and the size several times is more. 
In this paper numerical comparison of influence of the errors 
which have occurred on key frames, for codecs MPEG-2 and 
MPEG-4 are given. 

This paper is organized as follows: Section 1 describes 
about premises for mathematical modeling of dependence of 
video quality from characteristics of network connection, in 
Section 2 planning of experiments is explained, and results of 
processing of experiments are presented in Section 3, Section 
4 tells about numerical results of experiment and parameters 
of mathematical model. 

II. Premises for modeling 

The communication quality of video transmission wors- 
ens depending on characteristics of network connection. In 
order to describe quality of transferred video depending on 
network parameters the universal function Qmos{PtJ7 D, B), 
describing video quality on a MOS scale has been considered. 
This function can be expanded in a power series on network 
parameters, thus it is possible to be limited to linear members. 

For the fixed speeds of video stream it is enough to consider 
only a linear dependence from two parameters (losses of 
packets and network jitter) [TJ: 



ImOS = Qideal - ap- /3j, 



(1) 



where 



• Q ideal " maximum quahty of video for the given codec, 
points from zero to five; 

• p - packet loss, %; 

• i - network jitter (delay variation), sec; 

• Qmos - video quality on received side, points from zero 
to five; 

• a, (3 - coefficients of model which can be found experi- 
mentally. 

The uniform video sequence which was compressed by 
codecs MPEG-4 (DivX), MPEG-2 and Windows Media video 
9 with constant bitrate 256 Kbps has been developed for 
experimental tests. 

The basic aim put in the present research is revealing of 
influence of key frames on quality of received video. Therefore 
coefficients a'' and (3'' for a stream with damage of key frames, 
and also a™ and /3™ for sequences without damage of key 
frames are separately calculated. For example, ct'^^yx ^i^^ 
designate coefficient which characterizes deterioration of video 
encoded by MPEG-4 (DivX) if a key frame was damaged. 
Further in paper coefficients a|)j^,x ^^'^ '^DivX' ^^ ^^^^ ^^ 
'^Mpeg2 '^Mpeg2' ^^^^ ^^ found and Compared to define in 
numerical sort a measure of video deterioration. 



III. Experiment planning 

For finding of coefficients from the Equation (1) we had 
been developed and spent a number of experiments. Videos 
files encoded by MPEG-4 (DivX), MPEG-2 and Windows 
Media Video 9 were transferred on a notebook connected to 
wireless network WiFi, WiMAX or 3G. On a notebook record 
of received video was spent to a file, the network traffic at 
packet level was in a parallel way written by network sniffer 
Wireshark. Thus, under the received video it is possible to 
find video quality on scale MOS, and parameters of network 
connection may be calculated on network logs. 

The software was applied to carrying out and the analysis 
of experiments (see Fig. [T]): 

1) VLC media player |I9] was used as a program video 
server and a video player with possibility of record of 
video received on a network in a file on a notebook. 

2) The Wireshark Network Protocol Analyzer ifTTI . with its 
help on a notebook all network traffic was written, and 
then analyzed. 

3) VirtualDub ifTOl . by means of this program the frame 
analysis of the video was made for calculation MOS. 

4) AviSynth 2.5 |1|, with its help video encoded WMV 
was parsed in program VirtualDub. The given codec 
works using technology DirectShow, instead of VFW 
(Video For Windows), and cannot be opened directly in 
VirtualDub. 

All video fragments written during experiments and network 
logs are published on site of Internet TV ltd lE). 

Experiments will look equally for networks Wi-Fi, WiMAX 
and 3G, only network equipment varies. In the Figure [T] the 
scheme of experiment is presented. 

Alone video clip with various types of the picture has been 
prepared for experiment: static, with weak movement, with 
fast movement, with brightness change. Then this video clip 
has been encoded with usage of MPEG-4 (DivX), MPEG-2 
and Windows Media Video 9. Following parameters of video 
are installed: 

• Video Resolution 320 x 240 pixels 

• Frame Rate - 24 Frames per second 
. Bitrate - 256 Kbps 

• Maximal quality 

Files of initial video are also published on site of Internet 
TV ltd |8|. 

Software product Videolan ||9l was used as the program for 
network video translation. This program was started from a 
server connected to an Internet with real IP address. 

VLC media player and network sniffer Wireshark writing 
network traffic have been installed on the computer which 
accepted video. VLC media player has been customized on 
simultaneous demonstration of accepted video on the screen 
and its saving in a video file. Thus, at carrying out of each 
experiment two files for the further analysis are written: a 
video file and a file of network logs. 

For carrying out of experiments we were used LAN of 
Samara State Aerospace University (WiFi), the Samara seg- 
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Fig. 1 . Sclieme of experiment 



ments of all-Russian operators: the Megaphone (3G), a Beeline 
(3G) and Metromaks (WiMAX) which have free of charge 
given the equipment and technical possibilities for test trials. 

IV. Processing of results of experiments 

For a network of Wi-Fi standard periodic deterioration 
of network, accordingly the picture is characteristic worsens 
during the corresponding moments. The order of the analysis 
of all received video fragments and network logs is identical. 
Further process of the analysis of video files and a correspond- 
ing network logs during the moments of video deterioration 
is described: 

1) The Program VirtualDub opens the recorded video file. 
The frame previous the distorted frame is searched, in 
the Figure |2] it is a frame 143 which is displayed on 
5923's millisecond ms of a video series. Similarly a 
last distorted frame is searched, in a described case it is 
IVl's frame shown on 7083 millisecond. Thus, duration 
of distortion consists of 28 frames or 1160 ms. 

2) In the network sniffer Wireshark the corresponding logs 
( |http://stre am. ip4tv.r u7wirelessAViFi/testl/ldivx.pcap| ) 
are loaded. For RTP packets Wireshark has the built in 
analyzer, the necessary stream is selected from it for 
more detailed analysis. In the analyzer (the Figure [3]l 
is resulted the list of all packets and red places where 
packages have been lost are flagged. Also the statistical 
data on an inter-packet interval and jitter is specified. 
From column Sequence it is visible that are accepted 
a package with number 30195 and a package with 
number 30198, and between them two packages have 
been lost. 

3) The main problem of the analysis - how to correlate 
a file of network logs with video recording. For this 
purpose in a network logs packet (226, see Figure Hli is 
searched after which there was an error In relation to 
the beginning of record of dens this package has arrived 
on 10,87 second. In item 1 it has been found that in 
a video distortion has occurred on 143 frame which 
was displayed on 5923 ms of a video series. Thus, the 
required relation is defined. 



4) For reception of authentic statistical calculations it was 
accepted that the length of sequence for the analysis 
should be multiple to 100 packets. Now it is not difficult 
to count up percent of the lost packages. 

5) In Analyzer RTP the inter-packet interval and network 
jitter also is specified. 

Other major task arising at processing of experimental 
results is an estimation of quality of video on scale MOS. 
Algorithm of estimation the following: 

1) Labels in program VirtualDub (the Figure |2]i are in- 
stalled on the first distorted frame of video and last one. 

2) The distorted video can be viewed some times and to 
compare it to the original. 

3) Quality of the video at the moment of an error on scale 
MOS from 1 to 5 is visually estimated. 

In the present paper for each error the estimation was 
exposed by 4 experts. Then by their estimations arithmetic 
mean value was calculated. 

At processing of the files encoded by WMV9 codec there 
were complexities. Unfortunately, program VirtualDub does 
not display key and not key frames for video encoded by the 
codec of Windows Media Video 9, therefore experiments with 
it us have not been handled. 

V. Analyzing of results of experiments 

Obtained data has been handled by the technique described 
in the previous section. All errors, both at video level, and 
on a network layer have been parsed. Subjective quality of 
video Qmos depending on packet loss p and network jitter j 
is found. The received values of coefficients are gathered in 
Tables D and nil 

It has been found that quality of the video depends not 
only on percent of packet losses and network jitter, but also 
from frame type on which there was an error. Key frame 
carries in itself the complete information on the video and 
can be restored without engaging of the additional data, and 
usual one is a frame which encodes a difference between the 
previous frame and flowing. Accordingly, if the error damages 
a key frame quality of the video worsens more strongly in 
comparison with a similar error in a usual frame. Therefore 
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Fig. 2. Seai'ch of distortion frames 
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Analysing stream From 192.168.2D5.21 port lUS to 192. 168.207.200 port 1234 SSRC - Dx2E0C829D 



Packet 


*■ Sequence 


< Delta(tr>s) 


< Filtered :itter(nis) 


■t skemfms) 


< IPBW(kbps) 4 


Marker 


< Status < 


220 


301B5 


73,03 


13,25 


944,84 


292,90 


SET 


[Ok] 


221 


30190 


12,38 


13,77 


966,42 


303,74 


SET 


[Ok] 


222 


30191 


14,34 


14.11 


985,53 


303,74 


SET 


[Ok] 


223 


30192 


33,05 


13,34 


987,37 


303,74 


SET 


[Ok] 


224 


30193 


45,76 


12,83 


982,17 


303,74 


SET 


[Ok] 


225 


30194 


61,09 


13,31 


961,62 


292,90 


SET 


[Ok] 


226 


30195 


23,64 


13,54 


978,54 


303,74 


SET 


[Ok] 


227 


30198 


326,14 


25,6? 


770,92 


206,11 


SET 


Wrong sequence nr 


226 


30199 


25,26 


24,82 


733,09 


216,96 


SET 


[Ok] 


229 


30200 


25,00 


24,05 


795,53 


216,96 


SET 


[Ok] 


230 


30201 


12,61 


24,10 


820,35 


227,81 


SET 


[Ok] 


231 


30202 


12,38 


24,16 


845,41 


238,66 


SET 


[Ok] 


232 


30203 


24,98 


23,40 


8S7,45 


238,66 


SET 


[Ok] 


233 


30204 


12,38 


23,45 


831,57 
Id 


249,50 


SET 


[Ok] 



Max delira = 0,00 ms at packet no, 

Max jitter = OjOO ms. Mean jitter = 0,00 ns. 

Max skei\i = -1216^93 ms. 

Total RTP packets = 3367 (expected 3367) Lost RTP packets = 104 (3,09%) Sequence errors = 6 

Duration 118,53 5(-66m5 clock drift, corresponding fa S99S0H? (-0,06%) 



Savepayload.. 



Jump to 



Graph 



Fig. 3. EiTor of RTP packet sequence 



N 


Codec 


Qideal 


„fc 


f}^ 


Q™ 


/3^ 


1 

2 


MPEG-2 
DivX 


4.2±0.2 

4.7±0.2 


0.15±0.03 
0.27±0.05 


0.011±0.002 
O.013±0.O03 


0.04±0.01 
0.13±0.02 


0.003±0.001 
0.01±0.002 



TABLE I 

Values of coefficients of model for codecs MPEG-2, DivX in WiFi networks 



N 


Codec 


Qideal 


«fc 


pk 


a"' 


^w 


1 
2 


MPEG-2 
DivX 


4.2±0.2 

4.7±0.2 


0.005±0.002 
0.01±0.003 


0.005±0.002 
0.003±0.001 


0.004±0.001 
O.002±0.0005 


0.003±0.001 
().002±().0008 



TABLE II 

Values of coefficients of model for codecs MPEG-2, DivX in 3G networks 
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Tiffie 

223 10.748433 

224 10.794189 

225 10.855277 



Source 

OTS 3602.254166666 
OTS 3602.295S77777 
DTS 3602,337588383 



DTS 3602.379288888 



Destination 

PTS 3602.254166666 
PTS 3602.295877777 
PTS 3602.337583883 



PTS 3502.379288888 



227 11, 

228 11. 

229 11. 

230 11. 

231 11. 

232 11. 

233 11. 

234 11. 

236 11. 

237 11. 

238 11. 

239 11. 

240 11. 

241 11. 

242 11, 

243 11, 



205D5S 
230319 

255318 
267929 
280305 
30S282 
317560 
334679 
373 546 
59060 
471706 
484363 
496918 
502268 
541773 
584186 




. S 04 411111 
. 546122222 

.587833333 

.629544444 

.671255555 

205.21 

.712955555 

. 754666666 

.796377777 

.S3S038S8S 

.379738388 

.921SQ0000 

.004922222 

.046623 223 

205.21 

.130044444 



PTS 3602 
PTS 3602 
PT5 3602 
PTS 3602 
PTS 3602 
192.163 
PTS 3602 
PTS 3602 
PTS 3602 
PTS 3602 
PTS 3602 
PTS 3602 
PTS 3603 
PTS 3603 
192.163 
PTS 3603 



504411111 

546122222 

587333333 

.639544444 

.67125555! 
207.200 

712955555 
.754666666 

796377777 



379788388 
921500000 
004922222 
046633233 

207.200 

.130044444 



Protocol Info 

KIH HI-MHl 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 
RTF PT^MPE 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 

MPEG PES 
RTP PT=MP[ 

MPEG PES 



1= Frame 22 6 (1370 byte^ on wire, 1370 bytes captured^ 

> Ethernet II, src: AniedTe_13 ; 7c: ec (;O0:aO:d2 :13 :7c [ec), D5t : A5keyCom_31:4d:fl (00:21:63:3 
t> Intarnst Protocol, Src: 1B2.168. 205. 21 (192.168.205.213, Ost : 192.168.207.200 (192.168.207. 

> user Da taqraiii Protocol, src Port: ardus-trns (1 1153, Dst Port: s earch-agent C1234") '' 



0000 00 21 63 31 4d fl 00 dO d2 13 7c ec OS 00 4 5 00 ,!clM. 

0010 03 4c e7 54 00 00 7f 11 31 Id CO a8 cd 15 CO a8 .L.T.. 

0020 cf C8 04 5b 04 d2 05 38 da le 80 al 75 f3 13 53 ...[.. 

0030 87 cl 2e Oc 82 9d 47 40 44 14 00 00 01 eO 04 a? 

0040 SO CO Oa 31 4d 4d 3a fl 11 4d 4d 3a fl 00 00 01 . . . IMH 

9 File: "K:\TeCT l\DivX.pcap" 4435 KB 00; 02:48 
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Fig. 4. Network logs 



in Tables Ul M it is specially selected two types of coefficients 
- with losses on key frames and without them. 

In Tables |l] |ll] a'' and /3'^ are coefficients of model with 
losses of packets on key frames, a™ and f3^ are coefficients 
for the intact key frames, and Qideai is an estimation on scale 
MOS for a source file (before transfer on a network). 

Initially for the video encoded by MPEG-4 (DivX), quality 
is above, than for video encoded MPEG-2, but at deterioration 
of characteristics of a network quality decreases considerable 
and at the big network interferences becomes similar to quality 
MPEG-2. In case of damage of a key frame video quality 
encoded MPEG-4 (DivX) will fall more than in 2 times at the 
same characteristics of a network, and MPEG-2 in 3.75 times. 

Thus, for substantial increase of video quality by trans- 
mission to a wireless network it is necessary to fulfill three 
mandatory items on upgrade of the circuit of link: 

1) The server of stream video should duplicate the packets 
containing the information of key frames 

2) To upgrade a player on the receiving side automatically 
to throw back duplicated RTP packets 

3) The period between key frames cannot exceed 2 seconds 
(optimally 1 second) 

These simple measures will lead to what even poor 
(GAP) 13] networks will translate video with estimation more 
then 3.5. It is necessary to notice that for codecs MPEG-4 
(DivX) and MPEG-2 the transferred size of the traffic will 
increase on 7-10%, and quality of video grows not less than 
in 2 times. Some researched video players at playback MPEG- 
TS MPEG-4 streams already automatically discard repeating 
packets. 

The experiments spent on network WiMAX, have shown 



very good stability of the given network to link deterioration. 
WiMAX networks under the characteristics are comparable 
to fixed networks Ethernet. It is very difficult to find network 
errors since the percent of losses for all spent tests is near 0%, 
and a variation of delay of an order 20 msec even in the tests 
spent at the big competing traffic. According to paper [Tl such 
type of the traffic on system GAP concerns to good (Good). 
3G networks are very sensitive to external interferences 
and even at good level of a signal there are considerable 
losses of packets (Poor) and a delay variation of an order 
40 msec (Acceptable). Also it has been found that for one of 
3G providers the equipment frequently duplicated outgoing 
packages that created on video very strong artefacts. New 
player is needed for eliminating the given interferences. 

VI. Conclusion 

There is an increasing demand of voice and video appli- 
cations on wireless devices due to the recent developments 
in smart phones and tablet PCs. To cater to this demand 
efficiently and reliably, it is important for the application 
developers and service providers to characterize and tune 
the performance of RTP streams that deliver the content. In 
addition, the area of video performance measurement is in its 
early stages, and developing effective techniques to measure 
video quality is vital. In this paper, we address both the above 
dimensions of requirements. We show scenarios where RTP 
packets are duplicated and evaluate how the video player 
should handle these duplicated packets. We also show a novel 
experimental technique to identify key frames in video, and 
evaluate how redundancy of those key frame packets can 
improve video quality. 



A significant contribution of our work is the development 
of a mathematical model for estimating video quality of 
codecs MPEG-2 and MPEG-4 (DivX) codecs used in open- 
source VLC on WiFi and 3G standards compliant wireless 
networks. Our model development has been done with due 
considerations given to the handling of duplicate packets, and 
adding redundancy to key video frames. 

The video quality measurement experimental technique and 
research findings presented in this paper have been imple- 
mented in our Internet video broadcasting efforts at SSAU, 
Togliatty branch SSAU and Internet TV service. Owing to 
these implementations, we have observed that our Internet 
video broadcasting service offering has become more auto- 
mated, predictable and our service has seen notably lowered 
operation costs. 
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Professor from University of Mexico for fruitful dialogues 
in which course the idea of this article has taken shape. 
Also it would be desirable to thank all collective of technical 
service RIPE ncc and especially Ruben van Staveren and 
Roman Kalyakin for constant assistance in comprehension of 
subtleties of a measuring infrastructure. We also would like to 
express the gratitude to the Wolfram Research corporation, 
which the first has marked our preprint and has given us 
licenses for the right of use of Mathematica. 
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